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Abstract 

The benefit of multi-anteima receivers is investigated in wireless ad hoc networks, and the main finding is that 
network throughput can be made to scale linearly with the number of receive antennas n„ even if each transmitting 
node uses only a single antenna. This is in contrast to a large body of prior work in single-user, multiuser, and ad 
hoc wireless networks that have shown linear scaUng is achievable when multiple receive and transmit antennas (i.e., 
MIMO transmission) are employed, but that throughput increases logarithmically or sublinearly with when only 
a single transmit antenna (i.e., SIMO transmission) is used. The linear gain is achieved by using the receive degrees 
of freedom to simultaneously suppress interference and increase the power of the desired signal, and exploiting the 
subsequent performance benefit to increase the density of simultaneous transmissions instead of the transmission 
rate. This result is proven in the transmission capacity framework, which presumes single-hop transmissions in the 
presence of randomly located interferers, but it is also illustrated that the result holds imder several relaxations of 
the model, including imperfect channel knowledge, multihop transmission, and regular networks (i.e., interferers 
are deterministically located on grids). 

I. Introduction 

Multiple antenna communication has become a key component of virtually every contemporary high- 
rate wireless standard (LTE, 802.1 In, WiMAX). The theoretical result that sparked the intense academic 
and industrial investigation of MIMO (multiple-input/multiple-output) communication was the finding that 
the achievable throughput of a point-to-point MIMO channel scales linearly with the minimum of the 
number of transmit and receive antennas [2], [3]. Linear scaling in the number of transmit antennas can 
also be achieved in point-to-multipoint (broadcast/downlink) channels [4] even if each receiver has only 
a single antenna, or in the number of receive antennas in multipoint-to-point (multiple access/uplink) 
channels even if each transmitter has a single antenna [3]. In these two cases, the linear gains are enabled 
by the simultaneous transmission or reception of multiple data streams. 

A. Overview of Main Results 

In this paper we are interested in the throughput gains that multiple antennas can provide in ad hoc 
networks, rather than in channels with a common transmitter and/or receiver. If multiple antennas are 
added at each node in the network and point-to-point MIMO techniques are used to increase the rate of 
every individual link (i.e., every hop in a multi-hop route) in the network, then network-wide throughput 
naturally increases linearly with the number of antennas per node. Similarly, based on the quoted point- 
to-multipoint and multipoint-to-point MIMO results, linear scaling is also expected to occur if nodes are 
capable of sending or receiving multiple streams. The main finding of this paper is that network-wide 
throughput can be increased linearly with the number of receive antennas, even if only a single transmit 
antenna is used by each node (i.e., single-input, multiple-output, or SIMO, communication), and each 
node sends and receives only a single data stream. Furthermore, this gain is achievable using only linear 
receive processing and does not require any transmit channel state information (CSIT). 

The main result is obtained by considering an ad hoc network in which transmitters are randomly located 
on the plane according to a 2-D homogeneous Poisson point process with a particular spatial density. 

N. Jindal is witti tlie Univ. of Minnesota, J. Andrews is with ttie Univ. of Texas at Austin, S. Weber is with Drexel Univ. The contact 
author is N. Jindal, nihar@umn.edu. Preliminary results appeared at ICC 2009 [1]. 
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We consider a desired transmit-receive pair separated by a fixed distance, and experiencing interference 
(assumed to be treated as noise) from all other active transmitters. The received signal and interference are 
functions of path-loss attenuation and fading, and we assume that for a transmission to be successful, it 
must be detected with an SINR larger than a defined threshold p. The primary performance metric is the 
maximum spatial density of transmitters/interferers that can be supported such that the outage probability 
P[SINR < P] is no larger than an outage constraint e, and our particular interest is in quantifying how 
quickly this maximum density increases with the number of receive antennas np, (with /? and e fixed). 

If receive beamforming is performed, the receive degrees of freedom can be used to either increase the 
power of the desired signal (i.e., for array gain) or suppress interference, and these competing objectives 
are optimally balanced by the SINR-maximizing MMSE (minimum mean-square error) filter. Using the 
MMSE outage probability lower bound in [5], we develop an upper bound on the maximum density 
allowable with an MMSE receiver. In conjunction, we develop a density lower bound by analyzing the 
performance of a novel suboptimal partial zero forcing (PZF) receiver, which uses an explicit fraction of 
the degrees of freedom for array gain and the remainder for interference cancellation. By showing that 
both the lower and upper bound are linear in ^a, we can conclude that the optimum transmit density is 
9(nR), and we demonstrate that this allows well known metrics like the transmission capacity [6], [7], 
transport capacity [8], [9], and the expected forward progress [10] to all increase as ©(np) as well. 



B. Related Work 

In addition to the large body of work on point-to-point and multiuser MIMO systems, this paper is also 
related to several prior works that have studied the use of multiple receive antennas in ad hoc networks 
with Poisson distributed transmitters. References [11] and [12] considered precisely the same model as 
this paper, but studied the performance of slightly different receiver designs that turn out to yield very 
different performance. In [11] the receive filter is chosen according to the maximal ratio criterion and 
thus only provides array gairQ while [12] considers the other extreme where the antennas are used 
to cancel the strongest tir — 1 interferers but no array gain is obtained. Both receiver designs achieve 
only a sublinear density increase with ^r, at growth rates t^r^^" and tIr^^^/", respectively. On the other 
hand, we show that linear density scaling is achieved if a fraction of the receive degrees of freedom are 
used for array gain and the other fraction for interference cancellation, or an MMSE receiver is used. In 
[13] the performance of the MMSE receiver is investigated for a network in which the density is fixed 
and additional receive antennas are used to increase the per-link SINR/rate. It is shown that the average 
per- link SINR increases with tIr as /Ir"/^, where a is the path-loss exponent, which translates into only 
a logarithmic increase in per-link rate and overall system throughput. In contrast, we study the rather 
different setting in which the per-link rate is fixed and the density increases with tir, and we show that 
exploiting the antennas to increase density instead of rate provides a significantly larger (i.e., linear versus 
logarithmic in tIr) end-to-end benefit (c.f. Section IV-B| ). 



Early work on characterizing the throughput gains from MIMO in ad hoc networks includes [14]- 
[17] although these generally primarily employed simulations, while more recently [18]-[20] used tools 
similar to those used in the paper and developed by the present authors. However, none of these works 
have characterized the maximum throughput gains achievable with receiver processing only. 



The remainder of the paper is organized as follows. The system model and key metrics are described 
in Section |ll| The main results are derived in Section |Ill[ and then various extensions and relaxations of 
the model are considered in Section IV in all of these diverse permutations we observe that the linear 
scaling result still holds. We conclude in Section |V| 



'Although reference [11] also considers strategies involving multiple transmit antennas, here we have mentioned only the directly relevant 
single transmit/multiple-receive scenario results. 



3 



TXl 



TXO 



TX3 



TX2 




TX4 



Interferers 



Fig. 1. Example transmit-receive pair with four nearby interferers shown. In addition to the distances d (desired) and Xi (interferer i) 
shown, there is a fading value hi. 



II. System Model and Metrics 

We consider a network in which the set of active transmitters are located according to a 2-D homoge- 
neous Poisson point process (PPP) of density A (transmitters/m^). Each transmitter communicates with a 
receiver a distance d meters away from it, where it is assumed that each receiver is randomly located on 
a circle of radius d centered around its associated transmitter. Note that the receivers are not a part of the 
transmitter PPP. Each transmitter uses only one antenna, while each receiver has antennas. This setup 
is depicted in Fig. [1} The Poisson model is reasonable for uncoordinated networks, such as those using 

alohaE] 

By the stationarity of the Poisson process we can consider the performance of an arbitrary TX-RX pair, 
which we refer to as TXO and RXO. From the perspective of RXO, the set of interferers (which is the entire 
transmit process with the exception of TXO) also form a homogeneous PPP due to Slivnyak's Theorem; 
see [7] for additional discussion of this point and further explanation of the basic model. As a result, 
network-wide performance is characterized by the performance of a single TX-RX pair, separated by d 
meters and surrounded by an infinite number of interferers located on the infinite 2-D plane according to 
a homogeneous PPP with density A interferers/m^. 

Assuming a path-loss exponent of a (a > 2) and a frequency-flat channel, the nR-dimensional received 
signal Yo at RXO is given by: 

yo = rf-"/Xno+ Yl l^r"^'h,M, + z (1) 
ien(A) 

where \Xi\ is the distance to the i-th transmitter/interferer (as determined by the realization of the PPP), 
hj G C"^^^ is the vector channel from the i-th transmitter to RXO, z G C"^^^ is complex Gaussian noise 
with CO variance r/I, and Ui is the power-p symbol transmitted by the i-th transmitter. Consistent with a 
rich scattering environment, we assume that each of the vector channels hi have iid unit-variance complex 
Gaussian components, independent across transmitters. 



^In Section IV-C we briefly examine a regular interferer geometry that is a more appropriate model for networks employing more 
sophisticated random access techniques. 
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A. Performance Metrics 

If a unit norm receive filter vq is used, the resulting signal-to-interference-and-noise ratio is: 

SINR= Pd->'M' . (2) 

^ + Eien(A)Pl^nvSh,|2 

Without loss of generality, we assume that the distances \Xi\ are in increasing order in order to take 
advantage of the property that the ordered squared-distances |Xip, IX2P, . . . follow a 1-D Poisson point 
process with intensity vrA [21]. To simplify notation we define the constant SNR = as the interference- 
free signal-to-noise ratio, which allows us to write: 

SINK = ^^'^ , . (3) 

SNR ^ " Z^ien(A) ro^^«l 

The received SINR depends on the interferer locations and the vector channels, both of which are 
random. The outage probabilit)]^ with respective to a pre-defined SINR threshold {3 is: 

Pout(A) =P[SINR</5], (4) 

which clearly is increasing in A. It is often desirable from a system perspective to maintain a constant 
outage level e (e.g., to ensure that higher-layer reliability mechanisms are appropriately utilized), and thus 
the performance metric of interest is A^, the maximum interferer density such that the outage does not 
exceed e: 

K = max{A : Pout(A) < e}. (5) 

A 

The transmission capacity is the number of successful transmissions per unit area, and can be defined 
as Ae(l — e)b, where b = log2(l + is the data rate assuming a good channel code; this definition is akin 
to area spectral efficiency (ASE). Discus sion of how this metric translates to end-to-end metrics such as 
transport capacity is provided in Section IV-B|'' 



B. Receive Filters 

The SINR and maximum density depend critically on the receive filter that is used. The receive filter 
can be used to either boost the power of the desired signal (by choosing vq in the direction of hg) or 
to cancel interference, or some combination of the two. In this paper we consider the MMSE receiver, 
which optimally balances signal boosting and interference cancellation and maximizes the SINR, as well 
as a sub-optimal partial zero-forcing receiver, which uses a specified number of degrees of freedom for 
signal boosting and the remainder for cancellation. We assume that the receive filter is chosen based upon 
knowledge of the signal channel hg and the interfering channels {hj},^j^; this optimistic assumption of 



perfect receiver channel state information (CSI) is scrutinized in Section IV-A 

MMSE Receiver. From basic results in estimation theory, the MMSE receive filter is given, in unnor- 
malized form, by: 

vo = S^'ho. (6) 
where S is the spatial covariance of the interference plus noise 

5^ = S^I + ^" E l^r"h.ht. (7) 

ien(A) 

^The implicit assumption is that tlie channels and the set of active interferers are constant for the duration of a packet transmission but 
generally vary across transmissions. As a result, the outage probability accurate approximates the packet error probability experienced by 
each node; by the stationarity of the process, it also approximates the network-wide packet error probability. 

^Although Ac depends on the design parameters e, /3, and ub, in this work we are interested in the behavior of Ae with respect to rin 
while the other parameters are kept fixed. (A justification for keeping e fixed has already been put forth, and justification for not increasing 



f3 is provided in Section IV-B ) Thus, we henceforth denote Xc as a function of Wr. 
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Note that S is the covariance matrix conditioned on the interferer channels and distances {hj}^]^ and 
Amongst the set of all possible receive filters vq, the MMSE filter maximizes the received 
SINR. Its corresponding value is: 

gj^j^mmse _ hj^-^ho. (8) 

The corresponding outage probability and maximum density are denoted as PJ^™*''^(A) and Xf'^^^^nj^). 

Partial Zero Forcing Receiver. We also study a suboptimal receiver that explicitly cancels interference 
from nearby transmitters while using the remaining degrees of freedom to boost the power of the desired 
signal. More specifically, the filter vq is chosen orthogonal to the channel vectors of the k nearest interferers 
hi, . . . , hfc. The parameter k must be an integer and satisfy k < n^ — l|^ Amongst the filters satisfying 
the orthogonality requirement |vjhjp = for i = 1, . . . , A;, we are interested in the one that maximizes 
the desired signal power |vJhop. By simple geometry, this corresponds to choosing vq in the direction 
of the projection of vector hg on the nuUspace of vectors (hi, . . . , h^). More precisely, if the columns of 
the riR X — k) matrix Q form an orthonormal basis for the nuUspace of (hi, . . . , h^), then the receive 
filter is chosen as: 

Q^ho 
IIQ^holl 

The corresponding outage probability and maximum density are denoted Pout '^( A) and \^^^^^{n^), re- 
spectively. Notice that A; = and /c = rij, — 1 correspond to the extremes of MRC (vq = ho/||ho||) and 
interference cancellation of the maximum number of interferers (full zero forcing), respectively. Because 
the MMSE receiver is SINR-maximizing, we clearly have 

Pour(A) < P^u[-'(A) and X^'^n^) > \f-\n^) (10) 

for all k and any set of system parameters. 

Although suboptimal, it is beneficial to study the PZF receiver because it is more amenable to analysis 
than the MMSE filter and because its simple structure allows us to clearly understand why linear scaling 
is achievable. Note, however, that an MMSE filter should be used in practice because it also is linear and 
its CSI requirements are less stringent (PZF requires knowledge of individual interferer channels, whereas 
MMSE only requires knowledge of the aggregate interference). 



in. Main Results: Density Scaling with Receive Antennas 

In this section we prove the main result of the paper, which is that A™™'*''(nR) and X^^^~^{nYi) both 
increase linearly with rip. We prove this result in two parts: we first show that a lower bound to Ap^^~'^(?2r), 
and thus to Xf^'^^ijij^), increases linearly with tIr, and then show that an upper bound on \f™^^{n^) also 
increases linearly with Wr. 



A. Lower Bound: Achievability of Linear Scaling with Partial Zero Forcing 

First, we show that linear scaling is achievable by finding a lower bound on AP^^^^(nR) that is linear in 
?7,R. In order to develop the bound, we first statistically characterize the signal and interference coefficients 
when the PZF receiver is used. These characterizations are a consequence of the basic result that the 
squared-norm of the projection of a tir -dimensional vector with iid unit-variance complex Gaussian 
components onto an independent s-dimensional subspace is [22] |^ 

^Although performance could conceivably be improved by choosing k on the basis of the channel realizations, it is sufficient for our 
purposes to choose k in an offline fashion. The value of k is left unspecified for the time being. 

^Throughout the paper we abide by communications literature convention and define a xis random variable to have PDF f{x) — ^ ^^_\y ' 
which may differ slightly from the definition in probability literature. 
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We denote the signal and interference coefficients as 

S ^ IvJhoP (11) 



Hi ^ Iv^hil^ 1 = 1,2,... (12) 

and characterize the statistics of these coefficients in the following lemma. 

Lemma 1: For PZF-A;, the signal coefficient S is X2(nR-fc)' interference terms Hi, . . . , Hk are zero, 
and coefficients H^^i, Hk+2, • • • are iid unit-mean exponential (i.e., xh- Furthermore, S, H^+i, • • • 
are mutually independent. 

Proof: See Appendix |l} ■ 



Using this statistical characterization and the definitions in (11 )-(12) the received SINR is 

SNR ^ " l^i=k+l 

where the 5* and Hi terms are characterized in Lemma [T| the quantities |Xfc+ip, |Xfc+2p, • • • are the 
k-\-l,k + 2, . . . ordered points of a 1-D PPP with intensity vrA, and all random variables are independent. 
The aggregate interference power for PZF-A; is denoted as: 

oo 

h^^Y, l^r"^- (14) 

i=k+l 

and the expectation of this interference power is characterized in the following lemma: 
Lemma 2: For k > f — 1, the expected interference power is characterized as: 

EI4I = KA)» Y. < M^^)' (f - - [f ]) . (15) 

i=k+l ^ ' 

where r(-) is the gamma function and [■] is the ceiling function, with the upper bound valid for k > [|] . 
Proof: See Appendix [llj ■ 

To derive the main result for PZF, we use Lemma [2] to upper bound E[l/SINRP"^-'=] and then combine 
this with Markov's inequality to reach an outage probability upper bound, and, inversely, a density lower 
bound: 

Theorem 1: The outage probability with PZF-A; is upper bounded by: 

P^ul A < ^ — ^ (16) 

Uj^-k-l 

for [|] < A; < — L In turn, the maximum density Ap^'^''(?7.j^) is lower bounded by: 



a 



(17) 



for any k satisfying [|] < k < — I 



/3 



t SNR- 
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Proof: The outage upper bound is derived by rewriting the outage probability as the tail probability 
of random variable 1/SINR and then applying Markov's inequality as follows: 



iPZF- 



P 



SINRP'*' 



1 

-r> - 
- j3 



(a) 
< 



(b) 



< 



1 



P 
(3 



E 



1 



4 + 



SNR 





'1" 


E 




S_ 



. 2 



+ 



1 

SNR 



k-1 



(18) 



(19) 



(20) 



(21) 



where (a) is due to Markov's inequality, (b) is due to (13) the independence of and 5*, and (c) follows 
from Lemma jlj and because S is xlin^-k) ^[VX2«] = — 1) for / > 1. Setting this bound equal 
to e and then solving for A yields the associated lower bound to A^. ■ 



It is worthwhile to note that the (rip 
increase due to array gain (i.e 



gj^Pj^) ° term in the Ae lower bound is the density 
increased signal power), while the [k '"'^"'^^ 



[|]) " term is the density 
increase due to interference cancellation. Thus, the bound succintly illustrates the tradeoff between array 
gain and interference cancellation. 

In order to show the achievability of linear scaling, we need only appropriately increase k with tIr . If 
we choose the number of cancelled interferers k = 6n^ for some constant < 6^ < 1, the density lower 
bound becomes: 



TT 



rf2 



1 + 



SNR 



1-9 



0- 



1-^ 



(22) 



Because the conditions for Theorem [T] are satisfied for sufficiently large if k = On^^ with < < 1 
(for any e > and SNR > /? > 0), the lower bound scales linearly with n^. This result is formally stated 
as follows: 



Lemma 3: For any 6 satisfying < < 1, 

Af^-^"R(nR) 



> 



/3 



TC 



(23) 



for sufficiently large np . 

This perhaps surprising scaling result can be intuitively understood by examining how the signal and 
aggregate interference power increase with n^. Choosing 6 < 1 ensures that the signal power, which is 

we can see that the condition 



"A is linear in rij, . These linear 



^2(i-6i)nH' increases linearly with n^. Based on the upper bound in Lemma 
6* > ensures that the interference power increases only linearly with n^. 
terms are offsetting, and thus allow an approximately constant SINR to be maintained as A is increased 
linearly with n^. 

On the other hand, linear scaling does not occur if k is kept constant. Specifically, \i k = k for some 
constant k, then the signal power increases linearly with as desired but the interference power increases 
too quickly with the density (as A"/^), thereby limiting the density growth to tIr^/". Linear scaling also 
does not hold at the other extreme where all but a fixed number of degrees of freedom are used for 
cancellation: if k = n^^ — k for some constant k, then the interference power scales appropriately with 
the density but the signal power is xL ^nd thus does not increase with tir, thereby limiting the density 



mcrease to n, 



1-2/q 



. These results are consistent with the findings of [11] and [12]. 
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B. Upper Bound: The MMSE Receiver 

While the earlier result showed that a lower bound to Af™'*'^(?2R) scales linearly with tir, we make our 
scaling characterization more precise by finding an upper bound to Af™'*'^(nR) that also scales linearly 
with n-n. In order to obtain such a bound, we utilize the MMSE performance outage probability lower 
bound from [5]. Extended to the model in this paper, the bound states: 





|ho 


12 


\--\oo 


X^\ 


-''Hi 



</3 



(24) 



where ho and |Xi|, IX2I, . . . are defined as before, and the random variables Hi,H2, ■ ■ ■ are iid, unit-mean 
exponential random variables (independent of all other random variables). The bound in [5] is for fixed 
interferer distances and no thermal noise. However, by averaging over the interferer locations (according 
to the PPP) and using the fact that outage probability is increasing in the noise power i], we obtain ([24]) 
and see that it also holds in the presence of noise. 

The SIR expression in (24) is closely related to the SINR characterization for PZF-A; in (13). The 
denominator of the SIR in (24) is precisely as if a PZF receiver with A; = — 1 is used (i.e., the nearest 
— 1 interferers are cancelled, and the effective fading coefficients from the uncancelled interferers are 
iid exponential), while the numerator corresponds to PZF with A; = 0. Thus, the bound in (24) corresponds 
to an idealized setting where the receive filter cancels the nearest — 1 interferers but still is in the 
direction of ho. 

We translate this outage lower bound into a density upper bound in a manner that is complementary 
to Theorem [T] we upper bound the expected SIR and then apply Markov's inequaltiy to the success 
probability to obtain the following result: 

Theorem 2: The outage probability with an MMSE receiver is lower bounded by: 



a/2 



P-r(A) > 1 - ^ 1^ "' %^' ' ' ) (25) 

and, in turn, X'^^^^^n^) is upper bounded by: 

, , 2nR + 1 + f 
^-(n.) < ,,.,3°,„(i (26) 

Proof: See Appendix |lllj ■ 
This upper bound, which by ([10]) also applies to PZF, scales linearly with tIr. Combining Theorems 1 
and 2, it is clear that A'™'^'^(nR) and Ap^'(?7.r) both scale linearly with tIr, although numerical results in 



III-E will confirm that MMSE is better by a non-negligible constant factor. 



Because the SINR with a PZF receiver differs only slightly from the expression in ( [24] ), we can use 
precisely the argument of Theorem [2] to derive the following upper bound on AP^*~^(r;,R) that, unlike 
Theorem [T| applies for any < k < — 1: 

This allows us to upper bound the performance if MRC or full zero-forcing is used. For A; = (MRC), 
if we choose / = 2 the upper bound becomes 

while for A; = ?7,R — 1 (full zero-forcing) we choose / = Wr + 1 to get 

Xv-Un - IX 2 + a/(2nR) 

For MRC the upper bound is 0{n^l'^') while for full zero-forcing it is 0{n^~'^l°''). These upper bounds 
complement the matching lower bounds in [11] and [12], respectively. 
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C. Array Gain v. Interference Cancellation 

Because the MMSE receiver implicitly balances, through ([6]), array gain and interference cancellation, it 
is not evident how the MMSE utilizes the receive degrees of freedom. If the eigenvalues of the interference 
covariance S are roughly equal then the MMSE filter is nearly in the direction of ho; on the other hand, if 
the eigenvalues are very disparate then the MMSE filter is (approximately) in the direction of the projection 
of ho on the subspace orthogonal to the directions of the strong interfering eigenmodes. Thus, the fraction 
of degrees of freedom used for array gain instead of interference cancellation depends critically on the 
spread of the eigenvalues of S, which turns out to depend on the path loss exponent a. 

The MMSE receiver can be understood by studying the PZF receiver, and in particular by finding the 
value of 6 that maximizes the PZF density. Based on Lemma [3] it is clear that the PZF density lower 
bound depends on only through the term (1 — 9)a0^~a for large ^r. To determine the dependence of 
the PZF upper bound in ( |27| ), we first minimize the bound with respect to /, for large Nr and k = 6n^. A 
simple calculation finds that / = (^ ^^^^ ^ dn^ is the minimizer, and with this choice of / the dependence 

of the density upper bound also occurs only through the term (1 — 6)^6^~a. 

Thus, the upper and lower bounds depend on 9 only through the term (1 — 9)a9^~a. By taking the 
derivative (w.r.t. 9) and solving, we find that the maximizing value of 6' is: 

r = 1 - -. (30) 

a 

As a ^ 2 the degrees of freedom should be used to boost signal power rather than to cancel interference 
(i.e., 9* 0), because far-away interference is significant and so cancelling a few nearby interferers 
provides a smaller benefit than using the antennas for array gain. At the other extreme, 6** — > 1 as the 
path loss exponent increases because the power from nearby interferers begins to dominate and thus the 
antennas are more profitably used for interference cancellation than array gain. 
As it turns out, the optimizing va 

receiver. Fig. |2j contains a plot of E 



ue 9* and the above intuition are also consistent with the MMSE 



Ivjhol 



|Vo| 



the expectation of the squared correlation 



between the normalized MMSE filter and channel vector, versus a for ripj = 8. This quantity effectively 
measures the fraction of degrees of freedom used for array gain. For the PZF receiver this metric is 
precisely equal to nj, — k, and thus would be equal to 1 — 9* = 2/ a if k was chosen as k = 9*n^. 
Although the MMSE receiver implicitly balances array gain and cancellation (as opposed to the explicit 
balance for the PZF receiver), the plot shows that the MMSE receiver also utilizes (approximately) a 
fraction 2/ a of its receive degrees of freedom for array gain. Similar to the intuition stated for the 
behavior of 9* for PZF, the eigenvalues of S become more disparate as a is increased, and the MMSE 
receiver takes advantage of this by performing more interference cancellation (and thus providing less 
array gain) when a is larger. 

D. Improved Lower and Upper Bounds 

Although the bounds developed in Sec. |III-A and III-B are sufficient to show linear scaling, both of 



them are quite loose. This looseness is primarily due to the use of Markov's inequality, and the following 
more accurate bounds are instead derived through application of Chebychev's inequality: 
Theorem 3: The outage probability for the PZF filter is upper bounded by 



/•oo 1 

P!:t\X) <¥{S<a*)+ f3'V^T^\h) / -2fs{s)ds. 



(31) 



where S ~ x\n^-ky h is defined in (\A), a* = (3 (^[h] + + /va^^^(4)^ , and 



oo oo I oo 



V^i-\h) = {'Kd'\r{ J2 (E[7;-"]+Var(7;-^))+2 V (t^"^) V (t;" 



\i=k+l i=k+l j=i+l 
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Fig. 2. Average squared correlation between signal channel ho and MMSE filter as a function of path loss exponent a. Also shown is the 
approximation 2/a. 



Proof: See Appendix |IV| 
Theorem 4: The outage probability for the MMSE filter is lower bounded by 



where 5* ~ X2nn' ^^-r defined in (14), and 

oo 



nR,(nR, + 1) 



nR 



(32) 



(33) 



and 7 is the Euler-Mascheroni constant and ^/'o(^) is the poly-Gamma function. 

Proof: See Appendix |Vj ■ 

In the two theorems the random variables Tj are each chi-square with 2i degrees of freedom, and have 
moments characterized by: 

2 

, if i + 6 > 0. (34) 



Ti) 



T i) 



By equating these bounds to e and solving (numerically) for A, the PZF and MMSE densities can be 
lower and upper bounded, respectively. 
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Fig. 3. Maximum density versus tir for e — .\, — \, a = d — \, d — \. 



E. Numerical Results 

In Figures |3] and |4| the numerically computed maximum densities for the MMSE receiver Af™''^(?2R) and 
the PZF receiver X^'^in^^) with k = 9*n^ are plotted on a log- log scale versus for a = 3 and a = 4, 
along with the PZF lower bounds (from Theorems [T] and [3]), the MMSE upper bounds (from Theorems 
[2] and [4]), and the densities for MRC and full zero-forcing {X^^^'^iriYt) with A; = and k = n^t — 1, 
respectively). In each plot, the tighter of the upper and the tighter of the lower bounds correspond to 
the Chebychev-based bounds in the previous section. The bounds and numerically computed densities are 
representative of the linearly increasing density for PZF and MMSE, whereas MRC and full zero forcing 
both exhibit much poorer scaling. Figures [5] and |6] provide linear plots of the maximum density versus 
for more realistic numbers of antennas. Even a few antennas allow for very large density gains, and 
thus the asymptotic scaling results also are indicative of performance for small values of n^. The plots 
also make it clear that MMSE and PZF are strongly preferred to MRC or full zero forcing, and also that 
a non-negligible benefit is afforded by using the optimal MMSE filter rather than PZF. 



IV. Generalizations and Extensions of the Model 

In this section, we explore three of the potentially controversial aspects of our model to show that the 
linear scaling result is not an artifact of our model and assumptions: we remove the assumption of perfect 
CSI at the receiver, we evaluate the benefit of antennas from an end-to-end perspective, and we consider 
the importance of the interferer geometry. 
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A. Ejfect of Imperfect CSI 

The objective of this section is to illustrate that reasonable performance is achieved even if the receiver 
has to estimate the CSI, instead of assuming this information is a priori provided to the receiver. In order 
to design the optimal MMSE filter, the receiver requires an estimate of ho, the signal channel, and S, 
the interference (plus noise) covariance. The desired channel can be estimated at the receiver via pilot 
symbols and the effects of such training error are well understood [23]. On the other hand, it is not as 
clear how the receiver can estimate S and what effect estimation error has on performance. 

Recall that the covariance S depends on the interferer locations, and thus on the active interferers, 
as well as the instantaneous channel realizations. Although coordinated transmission of pilots seems 
infeasible in a decentralized network, the receiver can estimate the covariance by listening to interferer 
transmissions, in the absence of desired signal. If the desired transmitter remains quiet for K symbols, 
the receiver can use the K observations of noise plus interference to form the sample covariance 

^^^Er.rl (35) 

i=l 

where rj represents the i-th observation of the noise plus interference. Assuming, for simplicity, knowledge 
of ho, the receiver can then use the filter S ho and the corresponding SINR is 

(hJS^'ho)' 

SINR = (36) 

hJS EE ho 

This SINR was analyzed in [24] assuming that all interferers transmit independent Gaussian symbols, and 
it was shown that for every S, the expected SINR using filter S ho (expectation w.r.t the distribution 
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of S), and the SINR using the correct fiker S ho are related according to: 

- 1 



hSs ho 



hl± "^SS ^ho 



K 



hJS-^ho, (37) 



where from (jsj), hjs^^ho is the SINR when the proper MMSE filter is used. By taking an additional 
expectation with respect to S (which is determined by the interferer locations and channels), we see 
that the expected SINR using an MMSE filter based upon the sample covariance S is precisely a factor 
of 1 — ^^B^ smaller than the expected SINR with perfect knowledge of S. As expected, this factor is 
increasing in K and converges to one as K ^ oo, because S — > S as — oo. If K = 2?2r — 3, the 
expected SINR is decreased by 3 dB. 

Although the result of [24] applies to the expected SINR, numerical results confirm that the results 
also apply to the outage scenario considered here. Therefore, a system using the sample covariance from 
K observations and SINR threshold /3 has, approximately, the same maximum density as a system with 
perfect CSI and SINR threshold p/ (l - ^). From the various bounds, we see that the maximum 
density depends on the SINR threshold as Thus, using the /^-observation sample covariances 

instead of the true covariance reduces the density, approximately, by a factor of (l — "^i^l Y^"- If we 
choose K = 2^^ — 3 then the loss factor is 2^/" for all 77.^; therefore, by appropriately scaling K linearly 
with rifj performance within a constant factor of the perfect CSI benchmark is achieved. 

To solidify these conclusions, in Fig. |7] the maximum density is plotted versus K for a 6 antenna system 
with SNR = 10 dB. The curves correspond to perfect knowledge of ho, estimation of ho on the basis of 
two interference-free pilots (each at 10 dB), and the approximation of the perfect CSI density (0.41 in this 
case) multiplied by (l — ^^q^)^^"- From the figure we see that estimation of ho does not significantly 
reduce density, the approximate density expression is reasonably accurate, and that choosing K on the 
order of 10 or 20 leads to a density reasonable close to the perfect CSI benchmark. Indeed, even if such 
estimation must be performed for every transmission, the overhead is reasonable in light of the fact that 
packets are typically on the order of hundreds of symbols. 



B. End-to-End Throughput 

The transmission capacity quantifies per-hop performance, whereas end-to-end throughput depends on 
the range and rate of each transmission and the spatial intensity of such transmissions. Thus, a legitimate 
question is whether the linear scaling of transmission capacity translates into linear scaling of end-to-end 
throughput? In the context of the transport capacity [8], which is a widely accepted end-to-end metric, 
this question can be answered in the affirmative. Transport capacity is the product of rate and distance 
summed over all transmissions and thus is proportional to \^{\ — e)(ilog2(l + (3), i.e., the product of the 
successful transmission density, per-hop distance, and per-hop rate. Since the transport capacity is linearly 
proportional to A^, the linear density scaling established in Theorems 1 and 2 also translates into linear 
scaling of the transport capacity. 

Although increasing the density with leads to linear scaling of the transport capacity, it is not a priori 
clear if the antennas should instead be used to increase the transmission rate and/or range. Based on the 
scaling results in Theorems 1 and 2, we see that \^d^j3'^/'^ oc Tipj. Thus, if the density and rate (i.e., SINR 
threshold 13) are kept constant, then the range can be increased at order d oc -^/n^. Alternatively the SINR 
threshold can be increased at order /? oc np t, which translates to increasing per-hop rate approximately as 
|log2(l + nn). Because transport capacity is proportional to \^d\og2{l + /3), using the receive antennas 
to increase per-hop range only increases transport capacity at order ^/n^ while increasing per-hop rate 
leads to an even poorer logarithimic increase (consistent with [13]). Therefore, the most efficient use of 
the receive array, from an end-to-end perspective, is to increase the density of simultaneous transmissions 
rather than the per-transmission rate or distance. 
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Fig. 7. Maximum density versus K, number of interference observations, for e = .1, /3 = 1, a = 3, d = 1, and SNR = 10 dB. 

These points can be argued concretely within the framework of expected forward progress (EFP), a 
metric introduced in [10] that is defined as 



where \ = vph the density of transmitters subject to an ALOHA protocol where the entirety of nodes in 
the network (that have density u) transmit with probability p and act as receivers (and hence relays) with 
probability 1 —p, where p is a design parameter that is optimized offline. An opportunistic routing protocol 
is employed through which the successful receiving node (i.e., relay) offering the most geographic progress 
towards a defined destination direction is selected to forward each transmitter's packet, and the quantity 
E[Xo] is the expected progress offered by such relay. The expected distance is inversely proportional 
to the transmitter density p, and by understanding how the optimum p changes with respect to tir we 
can determine if receive antennas are more effectively used for increasing transmit density {p increasing 
rapidly with n^) or for increasing per-hop range {p approximately constant). 

Fig. [8] contains plots of EFP versus transmission probability p for = 1 to = 8 and the optimum 
value of p is seen to increase approximately linearly with tir, confirming that it is more effective to use the 
antennas to increase density. (Closer examination shows that the expected distance per communication, 
i.e. the per-hop range, is approximately constant with respect to tZr for the optimizing value p.) The plot 
also illustrates that the EFP itself is linear with n^, confirming that linear scaling holds for multihop 
wireless networks. To see that increasing the rate rather than density also is suboptimal, if p is kept fixed 
to the small value of 0.075 when tir = 8 and the spectral efficiency log2(l + /?) is set to 4 (this value 
leads to the same expected per-hop range as the optimizing p), then the resulting EFP is 0.3 instead of 
the 0.4 achievable if rate is kept fixed and p (i.e. the density) is increased. 

C. Effect of Interference Geometry 

Finally, we consider the effect of the interference geometry. In this paper we have assumed a homoge- 
neous Poisson distribution for the node locations, which is a realistic model if the users take up random 



EFP = i/p-E[Xo]-log2(l + /?), 



(38) 
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Fig. 8. Expected forward progress vs. transmission probability (p) for tir ranging from 1 (bottom) to 8 (top), for a = 3, P — dB, and 
no noise. The optimizing values of p are denoted with an X. 



locations and do not coordinate their transmissions. One might reasonably wonder, however, if the linear 
scaling result is an artifact of this model, since in a Poisson field the nearest interferers dominate and 
so interference cancellation might be far more profitable in this setup than in a more regular network. 
A "good" MAC protocol would seemingly space out the active transmitters at any instance, to avoid 
dominant interference. As a simple manifestation of such a MAC, we consider a regular network where 
the interferers take up positions on a square grid with edges of length 1 / -\/A. From Fig. [9] we see that a 
regular network allows for a larger density of simultaneous transmissions, but only by a constant factor that 
is independent of n^. Based on this, we conjecture that the linear scaling result holds for any reasonable 
network geometry. 

V. Conclusion 

The main takeaway of this paper is that very large throughput gains can be achieved in ad hoc networks 
using only receive antennas in conjunction with linear processing. In a point-to-point link receive antennas 
only provide array gain, which translates into a linear SNR and thus logarithimic rate increase (in the 
number of receive antennas). In an ad hoc network, however, receive antennas can also be used to 
cancel interference and this possibility turns out to yield much more significant benefits. In particular, 
the main result of the paper showed that using receive antennas to cancel interference and obtain some 
array gain allows the density of simultaneous transmissions to be increased linearly with the number of 
receive antennas when nodes transmit using only a single antenna. This result only requires channel state 
information at the receiver, which can be reasonably estimated, and the conclusion was seen to be robust 
to the particular interferer geometry. 

From an end-to-end perspective, this linear increase in the density of simultaneous transmissions 
naturally translates into a linear increase in network-wide throughput. In addition, our analysis showed that 
receive antennas are in fact best utilized by increasing the density of simultaneous transmissions rather 
than increasing the per-hop rate or range. Finally, although the single-transmit/multiple-receive antenna 
setting may appear artificial, subsequent work on this model has shown that it is actually detrimental to 
employ multiple transmit antennas when channel state information is not available to the transmitter [25]. 
As a result, the single transmit/multiple receive antenna setting is indeed very relevant. 
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Fig. 9. The optimal density for both Regular and Poisson networks increases linearly with nn, which regular networks having a slightly 
higher achievable density since nearby (dominant) interferers do not exist. 



Appendix I 
Proof of Lemma [T] 

By the definition of vq, the quantity Ivjhop is the squared-norm of the projection of vector ho on 
Null(hi, . . . , hfc). This nullspace is tt-r — A; dimensional (with probability one) by basic properties of iid 
Gaussian vectors and is independent of hg by the independence of the channel vectors, and thus Ivjhop 
is X2{nR-fc) P^]- "^^^ second property holds by the definition of the PZF-fc receiver. To prove the third 
property, note that vq depends only on ho, hi, . . . , h^ and thus is independent of h^+i, hk+2, ■ ■ ■■ Because 
the distribution of each channel vector is rotationally invariant (i.e., the distributions of Whj and hj are 
the same for any unitary matrix W), we can perform a change of basis such that vo = [1 ■ ■ ■ 0]^. 
After this change of basis, each vjhj (for z > A; + 1) is simply equal to the first component of hj. As 
a result vjhfc+i, vJhfe+2, • • • are iid complex Gaussians; thus the squared norms are iid exponentials, and 
furthermore these terms are independent of S. 



First we have 



E 



Appendix II 
Proof of Lemma |2] 



(39) 



=k+l 



where E [|X,;|-°iJi] = E E [Hi] = E due to the independence of \Xi\ and Hi and the fact 



that E[Hi 



1. Because iXif, X2 



are a 1-D PPP with intensity vrA, random variable 7rA|Xj|'' is 
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xli and thus has PDF f{x) 



^'"'^ Therefore 



(i-l)! 



E 



-a/2 



poo 

r(i-f) 



dx 



(40) 
(41) 



This quantity is finite only for i > |, and thus the expected power from the nearest uncancelled interferer 
is finite only if A; + 1 > |. 

To reach the upper bound, we use the following inequality from [12] (which is derived using Kershaw's 
inequality to the gamma function): 



r(z-f) 



< U 



where [■] is the ceiling function and we require i > [|] . Therefore 

OD 



i=k+l 



TU) 



< 



k+l 

oo 

k 

a 

2 ~ 



X 



dx 



(42) 

(43) 

(44) 
(45) 



where the inequality in the second line holds because a; 2 is a decreasing function. 



Appendix III 
Proof of Theorem [2] 

The outage upper bound is obtained by keeping the interference contribution of only the nearest / 



uncancelled interferers in ( [24| ) and applying Markov's inequality to the success probability: 

>f3 



(a) 



1 - p^z'^ix) < p 



< P 



< P 





hol 


2 




Xi\- 

l|ho 


2 



E 



j=nR 



>/3 



rf-"||ho| 



W 1 
< - E 



-a Y^"R-1+' 
^-a|IV,J|2 



Hi 



>/3 



|ho| 



\X, 



(46) 
(47) 
(48) 
(49) 



where (a) follows from (24), (b) because decreasing the interference increases the SIR and thus the success 
probability, (c) because |Xj| are increasing in i and the function (■)^" is decreasing, and (d) is due to 
Markov's inequality. By the independence of the various random variables: 



E 



d-°||ho| 



I Y 



nn-l+l\ 



E[||ho||']E 



,^/2 r (riR - 1 + / + f ) 
r (riR - 1 + /) 



(50) 
(51) 
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where we have used the fact that ||ho|p ~ xLr' Yl^=n^ ^ Hi is the sum of / iid exponentials and thus is 

Xl, and - 1 + /|2 ~ ±.xl^^^_^^^y 

By applying Kershaw's inequality, which states T {x + 1) /T {x + s) < (^x — ^ + ^/s~^^lJ4^ Wx > 
and < s < 1, and the property r(a; + 1) = xT(x), we have: 

^ < (^r-1 + / + 7t) • (52) 



r (^R, - 1 + /) - V " 2 



Substituting (|5T| and (|52]) into (|49]) yields: 

P^A) > 1 - (^A)-"/^ (nR - 1 + / + 1)"^' (53) 

By choosing / = tIr + 1 we obtain the desired outage probability lower bound, and by setting this bound 
to e and solving we get the density upper bound. 

Appendix IV 
Proof of Theorem [3] 

Write / for If The variance of a sum of rvs may be expressed as the sum of the covariances: 



oo oo 



Var(/) = {nd'XrYl E (^~^^- ^"^^i) 

i=k+l j=k+l 
oo oo 

i=k+l j=k+l 



oo oo 

a a 

2 



= (nd'xri J2 {2E[Tr]-nTr'^?)+2 J2 E {nmT,)-n-E[Tr^E[T: 

\i=k+l i=k+lj=i+l 

(oo oo oo \ 

E (e[7;-"] + Var(T-^)) +2 ^ E (^^^'^"^) • ^^4) 
i=k+l i=k+lj=i+l J 

Recall the Cauchy-Scharz inequality applied to the covariance of two rvs gives 

Cov(X, Y) < v/Var(X)Var(F). (55) 

Applied here: 



oo oo 



Var(/) < (vrrf^A)" E (^[^""]+Var(7;"^))+2 E E V (^"^) ^^^^^ 



=k+l i=k+l j=i+l 

oo oo 



= {TTd'Xr E (E[T;^1+Var(7;"^)) +2 E V (rr^) E A^K^' ^) V^^^ 

\i=k+l i=k+l ' j=i+l J 

We apply the Chebychev inequality as follows. Condition on all possible values for S, fix a parameter 
a > /9 (E[J] + gj^), and upper bound the tail probability on / by one for all s < a, then apply the 
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Chebychev bound on / — E[/] for each s > a: 

P(SINRP^f-*^ < (3) 



ppzf— fc 
-"out 



p I / > 



i - Mr) ^^(^^^^ 



p / > 



< f{S<a)+ P /> 



P SNR 



fsis)ds 



= F{S<a) + I^I^I- E[I] > ^ - ^ - E[I]^ fs{s)ds 
< F{S<a)+ rF(\I-E[I]\>^-^-E[I]]fs{s)ds 



< F{S<a) + 



Var(J) 



/3 SNR 

POO 



-E[I] 



p SNR 
jfs{s)ds 



F{S <a)+ /3Var(/) 



jfsis)ds 



(57) 



Viewed as a function of cr, this outage probability upper bound is a convex function in a with derivative 



d 

da 



5pzf— fc,ub 



(a) = fsia) - P'Varil) 



fsicr) 



cr — 



SNR 



■ pE[i]y 



Equating this to zero and solving for a gives 

Substituting this value of a into the upper bound gives the theorem. 

Appendix V 
Proof of Theorem g] 



(58) 



(59) 



Write / = /„j, . Then, 



-| pmmse 

out 



< P 

< P 
< 



Var (f ) 



>P-E 



The independence of S, I yields: 

'S 



E 



E\S]E 



, Var 



E[S^]E 



■ 1 
P 



-E[S]'^E 



(60) 



(61) 



We know S ~ ^xlnR thus E[S] = tir and EfS*^] = nRiriR + 1). It remains to find a lower bound on 
E []] and an upper bound on E [-^] . Jensen's inequality gives a lower bound on E [j] since the function 
1/x is convex: 



E 



> 



w] {^d-\r^YZn,nT-"'\ 



(62) 
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Consequently, using E[S] = ur. 



E[S]'^E 



> 



n 



R 



(63) 



By Lemma [6| the upper bound on E [-^] is: 

(7rrf2A)-"e2T 



E 



1 
P 



< 



Thus 



Var 



nniuR + 1) 



S\ ^ riRijiR + 1) n| 

" (-^^^)" (Er.„.e-(^-t.o«))^ - ^^^^^^^ ^^^^^ r(^y 



(64) 



(65) 



which completes the proof. 

Lemma 4: The function fa,bix, y) = ^iVi) with parameters a, 6 is a concave function in x, y for 
all a, 6 such that a + 6 G [— |, O] . 

The proof of the lemma is as follows. View fa,b{x,y) as a composition of functions 



fa,b{^,y) = h{ga,b{x,y)), h{g) = \, ga,b{x,y) = Vx^^y^ 

^ i 

Note h is convex decreasing with derivatives 



(66) 



(67) 



while g has derivatives: 



0^ = axr'y'l 



dyi 



6-1 



dy: 
d'^9a,bix,y) 
dyiyj 



dxidyi 
d^ga,b{x,y) 
dxiVj 



(68) 



Using 

dfa,bix,y) 
dzi 



h'{ga,b{x,y)) ' — , Zi G {xi,yi} 



dzidzj 



dzi 



dzj 



dziZj 



we compute the quadratic form as {x,y)\/'^ fa,bix,y){x,y) 



d^fa,b{x,y) d'^fa,b{x,y) ^d^fa,b{x,y) ^ d"^ fa,b{x,y) 



(9X7 



d^fa,b{x,y) d^fa,b{x,y) ^d^fa,b{x,y) , ^d^fa,b{x,y) 

— —y^ + + + — ^ — 



^ dyf ' 
2(a + 6)(l + 2(a + 6)) 

9a,b{.^^yY 



dyiXj 



(70) 



The sign of the quadratic form depends upon a, h only through the sum a + h. In particular, the quadratic 
form is negative for all a, h such that a + 6 G [— |, O] , and thus fa,bix, y) is concave for these values. 
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Lemma 5: For any a, h with a + h E [— ^, O] and h < —1: 



E 



1 
P 



< 



The proof of the lemma is as follows. Define Xj = and Yi = . Then: 



E 



t; 



E 



2 • 



(71) 



E 
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P 



E 



E 



E 



a, / i\b 



(7rrf2A)-E[/,,,(X,F)]. 



The proof follows by Jensen's inequality and Lemma |4} The constraint h < —1 follows from: 



E[if" 



r(i + c), c>-i 



oo, 



else 



(72) 
(73) 



Thus to ensure E 



H} 



< oo we must have 1/6 > —1, or 6 < — 1. 



Lemma 6: An upper bound on E[l//^] is 



E 



1 

P 



< 



(7rc/2A)-"e2T 



{TTd^Xy 



2 • 



(74) 



where 7 is the Euler-Mascheroni constant and tpoi^) is the poly-Gamma function. 
The proof of the lemma is as follows. Using 



E 



rn 2a _ I 2a) 

Til) 



E 



H} 



V I 



1 



(75) 



and Lemma |5] we pose the following optimization problem (to make the bound as tight as possible): 



max < r 1 , , 



b 00 
i=na 



Til) 



b< -1, a + b> --, a + b<Oy 



(76) 



For any b the optimal choice of a is such that a + b = —1/2 and for this (or any) choice of a the objective 
is unbounded as 6 ^ —00. Consider the limit as 6 — > —00 and a = —b — \. The proof follows from the 
facts that 



Hm r 1 + - 

6^-00 \ b 



and 



lim 



r z- 



2(-6-l) 



Til) 



(77) 



(78) 



where 7 is the Euler-Mascheroni constant and V^o(0 the zero-order poly-gamma function. 
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